Federated Ensemble Model-Based Reinforcement Learning in Edge Computing
نویسندگان
چکیده
Federated learning (FL) is a privacy-preserving distributed machine paradigm that enables collaborative training among geographically and heterogeneous devices without gathering their data. Extending FL beyond the supervised models, federated reinforcement (FRL) was proposed to handle sequential decision-making problems in edge computing systems. However, existing FRL algorithms directly combine model-free RL with FL, thus often leading high sample complexity lacking theoretical guarantees. To address challenges, we propose novel algorithm effectively incorporates model-based ensemble knowledge distillation into for first time. Specifically, utilise create an of dynamics models clients, then train policy by solely using model interacting environment. Furthermore, theoretically prove monotonic improvement guaranteed. The extensive experimental results demonstrate our obtains much higher efficiency compared classic challenging continuous control benchmark environments under settings. also highlight significant impact client data local update steps on performance FRL, validating insights obtained from analysis.
منابع مشابه
Performance Optimization in Mobile-Edge Computing via Deep Reinforcement Learning
To improve the quality of computation experience for mobile devices, mobile-edge computing (MEC) is emerging as a promising paradigm by providing computing capabilities within radio access networks in close proximity. Nevertheless, the design of computation offloading policies for a MEC system remains challenging. Specifically, whether to execute an arriving computation task at local mobile dev...
متن کاملReinforcement Learning: Model-based
Reinforcement learning (RL) refers to a wide range of dierent learning algorithms for improving a behavioral policy on the basis of numerical reward signals that serve as feedback. In its basic form, reinforcement learning bears striking resemblance to ‘operant conditioning’ in psychology and animal learning: actions that are rewarded tend to occur more frequently; actions that are punished ar...
متن کاملModel-Based Reinforcement Learning
Reinforcement Learning (RL) refers to learning to behave optimally in a stochastic environment by taking actions and receiving rewards [1]. The environment is assumed Markovian in that there is a fixed probability of the next state given the current state and the agent’s action. The agent also receives an immediate reward based on the current state and the action. Models of the next-state distr...
متن کاملEnsemble Pruning Using Reinforcement Learning
Multiple Classifier systems have been developed in order to improve classification accuracy using methodologies for effective classifier combination. Classical approaches use heuristics, statistical tests, or a meta-learning level in order to find out the optimal combination function. We study this problem from a Reinforcement Learning perspective. In our modeling, an agent tries to learn the b...
متن کاملMultiple Model-Based Reinforcement Learning
We propose a modular reinforcement learning architecture for nonlinear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic idea is to decompose a complex task into multiple domains in space and time based on the predictability of the environmental dynamics. The system is composed of multiple modules, each of which consists of a state predict...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems
سال: 2023
ISSN: ['1045-9219', '1558-2183', '2161-9883']
DOI: https://doi.org/10.1109/tpds.2023.3264480